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Random if-satisfiability (iC-SAT) is a model system for studying typical-case complexity of com- 
binatorial optimization. Recent theoretical and simulation work revealed that the solution space of 
a random A"-SAT formula has very rich structures, including the emergence of solution communi- 
ties within single solution clusters. In this paper we investigate the influence of the solution space 
landscape to a simple stochastic local search process SEQSAT, which satisfies a iC'-SAT formula in a 
sequential manner. Before satisfying each newly added clause, SEQSAT walk randomly by single-spin 
flips in a solution cluster of the old subformula. This search process is efficient when the constraint 
density a of the satisfied subformula is less than certain value acm\ however it slows down consid- 
erably as Q > Qcm and finally reaches a jammed state at a ~ Oj. The glassy dynamical behavior 
of SEQSAT for a > acm probably is due to the entropic trapping of various communities in the 
solution cluster of the satisfied subformula. For random 3-SAT, the jamming transition point Uj is 
larger than the solution space clustering transition point ad, and its value can be predicted by a 
long-range frustration mean-field theory. For random JC'-SAT with K > 4, however, our simulation 
results indicate that Qj = ad- The relevance of this work for understanding the dynamic properties 
of glassy systems is also discussed. 

PACS numbers: 89.70.Eg, 64.70.qj, 02. 10. Ox 

I. INTRODUCTION 

The random fC-satisfiability (isT-SAT) problem asks to determine whether or not a Boolean function with M clauses 
(each of which applying a constraint to K randomly chosen variables from a set of N Boolean variables) can be 
evaluated to be true. This problem has been studied extensively by computer scientists in the context of typical-case 
computation complexity of NP-complcte combinatorial satisfaction and optimization [1]. Statistical physics has played 
an important role in understanding the energy landscape of the random i^T-SAT problem, especially the (zero-energy) 
solution space structure of a satisfiable random K-SAT formula [U [31 HI [51 [5] . It was revealed that , as the constraint 
density a = M/N of a random K-SAT formula increases from zero, the solution space of the formula experiences 
a series of phase transitions before it finally becomes unsatisfiable as a exceeds a threshold value a^. Two very 
important phase transitions in this series are the clustering or dynamic transition at a = (where the solution space 
splits into exponentially many dominating Gibbs states) and the condensation transition a,t a = ac > ad (where the 
number of dominating solution Gibbs states becomes sub-exponential) . Statistical physics has also contributed a very 
efficient solver, survey-propagation (SP) [71[S], for the random K-SAT problem. SP is able to find solutions for large 
random 3-SAT formulas with millions of variables even when the constraint density of the formulas approach closely 
the satisfiability threshold as- The magic power of SP is not yet fully understood. These and other theoretical and 
algorithmic achievements make the interface between theoretical computer science and statistical physics a prosperous 
research area p,ITD]. 

As its energy landscape and solution space structure both are very complex, the random X-SAT problem may serve 
as an interesting model system for studying glassy dynamics. For example, similar to lattice glass models [11' and 
spin-facilitated kinetic Ising models \12\ 113] , one may apply a random external field to each variable of a random K- 
SAT formula and investigate how the configuration of the variables changes with time under the hard constraint of the 
clauses (which are not allowed to violate during the configuration evolution process). For a single-spin-flip dynamical 
process, the configuration of the system is confined to a connected component (a single solution cluster [14]) of the 
solution space of the iiT-SAT formula. However, the dynamics within such a solution cluster is not necessarily trivial. 
Simulations performed on random 3-SAT and 4-SAT formulas revealed that, as the constraint density a becomes 
relatively large, the structure of a single solution cluster of the formula is no longer homogeneous |15] . The solutions 
may aggregate into many communities, each of which containing a group of solutions that are densely connected 
to each other while the connections between two different communities are relatively sparse. In a community-rich 
solution cluster, a local search process will get trapped into a solution community for some time before it passes 
through a bridge region and enters into another different community. Such an entropic trapping effect will result in 
multiple time scales in the relaxation dynamics, which are typical of glassy systems. 

In this work we study the dynamics of a simple process SEQSAT which performs an unbiased random walk in a 
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solution cluster of a random if -SAT subformula Fm- The subformula F,„ contains the first m clauses {m — initially) 
of a long random K-SAT formula F of N variables. As soon as SEQSAT reaches a solution of Fm that also satisfies 
the (to + l)-th clause of formula F, this clause is added to subformula Fm and SEQSAT starts walking randomly in the 
solution cluster of the enlarged subformula F^+i again. The waiting time needed for SEQSAT to find a solution that 
satisfies the (to + l)-th clause gives a measure of the viscosity in moving in the solution cluster of the old subformula 
of to clauses. We find that, when the constraint density a = to/TV of the satisfied subformula Fm is small, the waiting 
time to satisfy the next clause is usually small. However the average value of the waiting time increases considerably 
as a exceeds a threshold value acm', and finally when a reaches a larger threshold value aj, the average waiting time 
essentially diverges and SEQSAT stops to satisfy the next clause. The dramatic slowing down of SEQSAT at a > acm is 
understood in terms of the complex community structures in the explored single solution clusters of the subformula 
Fm- We also calculate the mean value of the jamming transition point aj by a long-range frustration mean-field 
theory |16l I17j. and find that the theoretical prediction is in good agreement with simulation results in the case of 
random 3-SAT. For K = 3, aj is larger than the solution space clustering transition point ad, while for K > 4 it 
appears that aj coincides with ad- 

Recently the idea of constructing solutions for a constraint satisfaction formula by adding constraints one after 
another was explored by Krzakala and Kurchan [T5]. The present work differs from Ref. [TH] in that the SEQSAT 
process never allows the energy of the system to increase, while the WALKCOL algorithm used in Ref. [18 for the 
random Q-coloring problem is able to cross energy barriers. The SEQSAT process is also different from the CHAINSAT 
process of Alava and co-authors [Hj. Although CHAINSAT also prohibits any energy increases, it may satisfy a clause 
TO at the price of unsatisfying a clause to' that was previously satisfied. In the SEQSAT process, however, a satisfied 
clause will remain to be satisfied. The performance of SEQSAT may be further improved if such a hard constraint can 
be made more softer by introducing a positive temperature parameter. In this paper we also study the performance 
of a biased random walk search process. 



II. THE RANDOM 7f-SATISFIABILITY PROBLEM 



A K-SAT formula F contains N variables {i = 1,2,..., TV) and M clauses (a = 1,2,..., Af). Each variable i has a 
binary state ai = ±1, and each clause a represents a constraint which involves a subset da = {ii, i2, ■ - - , ik} of the N 
variables whose size \da\ = K- The energy of a spin configuration a = {cti, (72, . . . , ctn} is defined as 

M 

a=l 

with the energy of clause a being 

Ea=X{'-^- (2) 

In Eq. ([2]), is the recommended spin value to variable i € 9a by clause a. If at least one of the variables i £ da 
takes the recommended value Ui = Jl^, then Ea = and clause a is said to be satisfied. Clause a is violated and 
Ea = \ ii no variables i G da takes the recommended value J*. 

In constructing a random i^T-SAT formula, the K variables in the set da of each clause a are randomly chosen 
from the N variables, and for each variable i G da the preferred spin value is set to be -1-1 or —1 with equal 
probability. Given a random if-SAT formula one needs to find, among the total number of 2^ configurations, at least 
one configuration a that satisfies all the clauses {E{a) =0), or to prove that no such satisfying configurations exist. 
The random if-SAT problem is an extensively studied model in the context of typical-case computation complexity. 
When the number of variables N is very large, a random if-SAT formula has a high probability to be satisfiable 
(respectively, unsatisfiable) if the constraint density a = M/N is less (respectively, greater) than certain threshold 
value as{K)- For K = 'i, Kirkpatrick and Selman estimated Ug — 4.2 through extensive numerical simulations [T], 
while the mean-field theory of statistical physics predicts that as(3) = 4.2667 [U [20]. The theory of Mezard and 
co-authors |4j is also applicable to the random if-SAT problem with > 4. 

There are many algorithms for finding solutions for a random if-SAT formula. A widely used one is the branch 
and bound DPLL algorithm of Davis, Putnam, Logemann, and Loveland (one can refer to the review article |21j 
for the major developments on the DPLL algorithm). Frieze and Suen 22 (see also [22 and the review paper |5^ ) 
showed that for random 3-SAT with constraint density a < 3.003, the generalized unit clause heuristic has a high 
probability of constructing a solution in a single descent of the DPLL tree. But for a > 3.003, backtracking is needed 
and this may result in an exponential increase of computation time |23] . In the case of a > 3, many stochastic search 
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FIG. 1: Logarithm of the search time r{a) by SEQSAT to satisfy the first aN clauses of a random 3-SAT formula of A'' = 10^ 
variables. The solid lines are results obtained from 16 simulation trajectories from different initial spin configurations (half of 
the trajectories are generated by an unbiased random walk rule, while the remaining half are by a biased random walk rule). 
The red dashed line corresponds to the jamming transition point a°° = 4.189724 as predicted by the long-range frustration 
mean- field theory [T7], the green dotted line marks the cluster transition point ad [B], and the right-most black solid line marks 
the satisfiability threshold as [71|8]. 

algorithms are able to outperform the DPLL algorithm in average computation time, examples are WALKSAT [53] (an 
enhanced version of RANDOMWALKSAT [26"), ASAT [27], CHAINSAT [19], belief-propagation [28ll29] and its variants (such 
as reinforcement [30]), and SP [7, 8J. 

Mean-field theory of statistical physics [B] predicted that the solution space of a large random iiT-SAT formula 
experiences several structural transitions as the constraint density a increases. At the clustering transition point 
a — ad, the solution space of the formula splits into an exponential number of Gibbs states of equal importance. 
Later at the condensation transition point a = ac the solution space becomes dominated by only a few Gibbs states 
(for the special case of iC = 3, the clustering and the condensation transition coincide). It was also found that some 
variables will become frozen to the same spin value in all the solutions of a Gibbs state [HI1I12], this fact has serious 
consequences for stochastic local search algorithms. 

III. RANDOM WALK PROCESSES 

Several slightly different random walk processes were exploited in Refs. [T5j|33] to study the solution space structure 
of single random K-SAT formulas. Here, based on the same idea of random walking, we present a simple stochastic 
solver SEQSAT for the random K-SAT problem. Given a random K-SAT formula F with N variables and M clauses 
with index a = 1, 2, . . . , M, we denote by Fm the subformula which contains the N variables and the first m clauses. 
The constraint density of this subformula is a = m/N. We set m = at time r = and satisfy this clause-free 
subformula Fq by randomly picking a configuration (t(0) from the whole set of 2^ configurations. Suppose the m-th 
clause is first satisfied at time T(m), and the configuration at this time is CT(r(m)) . We then add the {m-\- l)-th clause, 
and if this clause is satisfied by a(T{m)), we set T{m-\- 1) — T{m) and (T(r(m-|- 1)) = a(^T{m)); otherwise we perform 
an unbiased random walk of single-spin flips starting from (j(T{'m)^ in the solution space of subformula F^ |34l until 
a configuration (T(r(m+ 1)) that also satisfies the (m+ l)-th clause is first reached at time T(m+ 1) = t(to) +A/'m+i, 
with N'm+i X N being the total number of spin flips used to reach d{T{m + 1)) from the solution d{T{rn)). The 
waiting time of satisfying the (m -I- l)-th clause is identifled to be Mm+i- 

Starting from a random initial configuration ct(0), SEQSAT satisfies the clauses of a iC-SAT formula F in a sequential 
order without overcoming any energy barriers. Simulation results of the next section demonstrate that such a simple 
steepest-descent algorithm actually has very good performance for single random _fir-SAT formulas. There are various 
possibilities for further improving SEQSAT. One extension is to introduce a small positive temperature T into the search 
process. A proposed spin flip is accepted with probability exp{—AE/T), where AE is the change in the configuration 
energy due to this spin flip. At flnite temperatures, the random walker can overcome not only solution space entropic 
barriers but also energetic barriers, therefore its dynamics will be even richer. Another extension, which we explore 
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FIG. 2: Same as Fig. [T] but for a random 4-SAT formula of A'' = 10^ variables. 

in this paper, is to use biased random walk when travehng from one solution to a nearest-neighboring one. 

The following biased random walk jumping scheme is used. Suppose in a configuration (?(<) at time t there are 
n flippable variables. We divide these n variables into two sets A and B: A contains all the variables that are not 
yet flipped since being flippable for the last time, and B contains all the flippable and flipped variables. If set A is 
non-empty, a variable is uniformly randomly chosen from A and its spin value is flipped, otherwise, a variable in set B 
is uniformly randomly chosen and flipped. Once a variable becomes flippable (because all its nearest-neighbor clauses 
are being satisfled by other variables) , it has a larger probability to be flipped under the biased random walk rule as 
compared with the previously mentioned unbiased rule. This gives the random walker a preference to explore new 
regions of the solution space. As we show in the next section, however, this change of local search rule does not bring 
in qualitative improvement in search performance. 

IV. SIMULATION RESULTS AND INTERPRETATION 
A. The case of if = 3 

The time T(a) needed by the unbiased SEQSAT random walk algorithm to satisfy sequentially the first aiV clauses 
of a random i^-SAT formula with N = 10^ variables is shown in Fig. [l]for K — 3. When the constraint density a 
of the satisfied subformula is very low, SEQSAT satisfy every newly added clause almost immediately; as more clauses 
are added into the satisfied subformula, SEQSAT has to make some local adjustments of the spin configuration to 
satisfy a new clause, which takes some time, but the dynamics is still very efficient. However as the constraint density 
of the subformula is beyond a « 3.8, the unbiased SEQSAT slows down considerably and then essentially stops to 
satisfy the newly added clause as a becomes even larger. For example, at a « 4.1 it can take several weeks for the 
unbiased SEQSAT to satisfy a new clause. As a comparison, the satisflability threshold of the random 3-SAT problem 
is as ~ 4.27 (7> i20j. 

We have observed the same dynamic behavior as shown in Fig. [T| when performing the same unbiased SEQSAT 
simulation on different random 3-SAT formulas. The search time T(a) of satisfying the first aN clauses is found to 
scale linearly with the number N of variables when N > 10^. Then for random 3-SAT formulas with different values 
of N, the curves of ln(T/A^) as a function of a can be superimposed onto each other. The diverging behavior of T{a) 
for large values of constraint density a suggests that walking within a solution cluster of the subformula becomes 
more and more viscous as a increases. According to our opinion, the main reason of this viscosity increase is the 
emergence of complex community structure in the solution cluster of the subformula Fm as revealed by Ref. [TS] (see 
Fig.[5|. To satisfy the newly added {m + l)-th clause, at least one of the attached K variables of the clause should be 
flipped. However, in order to make the necessary configuration rearrangement so that one of these K variables can 
finally be flipped, SEQSAT may have to travel through many local communities of solutions, each of them trapping the 
random walk process for a period of time. Such multiple trappings will make flipping a given variable a very difficult 
task. 

The whole solution space of a if-SAT formula can be represented as a graph in which each vertex denotes a solution 
and each edge between a pair of vertices means the two corresponding solutions are related by a single-spin flip [T3] . A 
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FIG. 3: Same as Fig. [l] but for a random 5-SAT formula of = 10^ variables. 

connected component (a solution cluster) of such a graph contains all the solutions that are reachable from a reference 
solution by a sequence of single-spin flips. In the SEQSAT process, after a solution <j that satisfies the first m clauses of 
the random iC-SAT formula F is reached, the (m-l- l)-th clause is added and SEQSAT performs a random walk of single 
spin flips starting from a in the solution cluster of subformula Fm until a conflguration a' which also satisfles the 
newly added clause is reached (i.e., the subformula F^+i is now satisfied). Both a and a' belong to the same solution 
cluster (say Cm) of subformula F^- As SEQSAT always tries to satisfy a newly added clause within the solution cluster 
of the satisfied old subformula, the solution cluster C,„+i of subformula -Fm+i reached by SEQSAT is a subset of C,„, 
i.e., 

Co ^ C*! ^ • ■ • ^ Cm ^ Cm^\ Z) . . . . (3) 

According to the numerical studies of Refs. [TSj [33], the connection pattern of a single solution cluster Cm of a 
random if -SAT subformula may be quite heterogeneous. Some of the solutions may be densely inter-connected 
with each other but are only very sparsely connected to the other solutions of the same cluster. These solutions then 
form a solution community (Fig. |5| . Different communities of the same solution cluster are linked together by inter- 
community edges and/or single solutions that lie at the borders of several different communities. In a community-rich 
solution cluster Cm of subformula Fm, some of the communities (e.g., community A in Fig. |5]) may only contain 
solutions that do not satisfy the {m+ l)-th clause of formula F. If SEQSAT starts unfortunately from a solution of such 
a community A, it then has to waste some time wondering along the internal edges of community A until it finally 
jumps onto an edge that leads to another different community B. (Even if the community A contains some solutions 
that satisfy the {m+ l)-th clause, if the fraction of such solutions in community A is very small, SEQSAT still will take 
time to reach them.) 

If the above-mentioned entropy trapping effect of solution communities is the major reason for the slowing down of 
SEQSAT, then the constraint density value a at which r(a) begins to increases rapidly should be close to the value of a 
at which community structure begins to appear in a single solution cluster of a random if-SAT formula. When running 
SEQSAT on random ii'-SAT formulas, we can define an empirical threshold value acm as the first constraint density at 
which SEQSAT takes Afcm time units (i.e., J^cm x N single spin fiips) or more to satisfy the next clause. Of cause acm is 
a random variable that takes slightly different values in different trajectories of SEQSAT on the same formula. For the 
eight unbiased random walk trajectories shown in Fig. [T] we find that Ucm ~ 3.37 if we set Afcm = 100 and acm ~ 3.82 
if we set Afcm = 1000. These results are consistent with the prediction of Ref. |3S] that the solution space of the 
random 3-SAT problem is heterogeneous and community-rich at constraint density a > 3.75. The heterogeneity of 
the solution space of the random 3-SAT problem causes the slowing down of the unbiased SEQSAT local search process. 

A clustering transition occurs in the solution space of the random 3-SAT problem at a = = 3.87, with the 
solution space breaks into exponentially many solution clusters . As shown in Fig. [l| when a > ad the unbiased 
SEQSAT process is still able to find solutions for a random 3-SAT formula. The solution space ergodicity-breaking 
transition therefore does not lead to divergence of the search time of SEQSAT. Similar phenomena were observed in 
previous studies [TSl US] ■ 
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FIG. 4: Same as Fig. [l] but for a random 6-SAT of = 10^ variables. 
B. The cases of K > A 

The simulation results of the unbiased SEQSAT process for random iiT-SAT formulas are shown in Fig. [2] (K — 4), 
Fig. [3] (K = 5) and Fig. |4] {K = 6). Similar to the results obtained for the random 3-SAT formula, as the constraint 
density a becomes large the SEQSAT process slows down exceedingly in the cases of K > i. Compared with Fig. [l]of 
the K = 3 case, the main difference for K > 4 is that the search time of the unbiased SEQSAT appears to diverge at 
the solution space clustering transition point a — a^. 

This difference may be understood by recalling that at and after the clustering transition point ad, the solution 
space of a random 3-SAT formula is still dominated by a few largest clusters, while that of a random if-SAT {K > 4) 
formula is dominated by an exponential number of relatively small clusters [B] . The spin values of a finite fraction of 
the variables may be frozen in each of these small clusters of a random iiT-SAT formula {K > 4) , and consequently a 
newly added clause has a large probability to be unsatisfied by all the solutions in the solution cluster. 

C. Comparing biased and unbiased random walk search processes 

Figures [T]|4] also compare the performance of the biased random walk search process with that of the unbiased 
random walk search process. For random 3- and 4-SAT formulas, the biased SEQSAT process is more efficient than the 
unbiased process, but both processes appear to diverge at the same critical constraint density values. For random 5- 
and 6-SAT formulas, on the other hand, the divergence point of the biased SEQSAT process is smaller than that of the 
unbiased process. 

V. LONG-RANGE FRUSTRATION THEORY ON THE JAMMING TRANSITION 

The simulation results of the preceding section demonstrate that, the random walk searching process SEQSAT, 
being prohibited from jumping between different solution clusters of a K-SAT subformula Fm , will eventually reach 
a jammed state after the first rumax = ctj^ clauses of the original random formula F have been satisfied. SEQSAT is 
unable to satisfy the {rrimax + l)-th clause if in the reached solution cluster of F,^^^^, the variables that are involved 
in this clause are all frozen to the 'wrong' spin value. The critical constraint density Uj — mmax/N (the jamming 
transition point) may be different in different runs of SEQSAT, as different runs of SEQSAT may reach different solution 
clusters of the solution space. Our simulation results reveal that the jamming constraint densities aj as obtained for 
many trajectories of SEQSAT on the same random ii'-SAT formula and on different random iiT-SAT formulas are very 
close to each other. It is anticipated that, in the thermodynamic limit of ^ 00, the SEQSAT process has a true 
jamming transition at a critical constraint density a°°. 

The jamming transition of SEQSAT is closely related to the freezing of variables in a solution cluster of a random 
if-SAT formula. Recently there were several interesting studies on the freezing transition of the solution space of the 
random A'-SAT problem |31l 1321 1361 137] . For example, the freezing transition point for the random 3-SAT problem is 




FIG. 5: Schematic diagram for the connection pattern of a single solution cluster Cm of a random Jf-SAT subformula Fm 
(containing the first m clauses of a larger formula F). Circles represent solutions, and edges represent a single-spin flips between 
two solutions of unit Hamming distance |15j . Those solutions that do not satisfy the (m + l)-th clause of formula F are denoted 
by filled circles, while the remaining solutions of Cm are all denoted by empty circles. Solutions in cluster Cm are grouped 
into many communities (A, B, C, etc.) such that a solution of a community is connected to many other solutions of the same 
community but are not or only very sparsely connected to solutions of other communities. 

located at a = a/ = 4.254 |32j, while that for the random 4-SAT problem is located a,t aj — 9.88 [3T1. These threshold 
constraint densities correspond to the appearance of frozen variables in the dominating Gibbs states of the solution 
space. On the other hand, the final solution clusters reached by the SEQSAT process should not be the dominating 
clusters of the solution space (otherwise, the {m + l)-th clause would be satisfiable). Therefore it is natural to expect 
(and is confirmed by simulation results shown in Figs. |l]and[2j) that the jamming transition point a°° of SEQSAT is 
less than the freezing transition point af. 

the jamming transition point aj for random iiT-SAT formulas with K > A are very close to the 
critical value ad- On the other hand, for random 3-SAT formulas, aj is much larger than aj,- We now try to predict 
the value of a°° as a function of K using the long-range frustration theory of Refs. \W, TT . In a solution cluster of a 
random i^-SAT formula with N —i- oo variables and constraint density a, some of the variables are unfrozen as their 
spin values are positive in some of the solutions of this cluster and negative in the remaining variables. We denote the 
fraction of unfrozen variables in this solution cluster as go- With respective to a pre-specified spin value a* , a unfrozen 
variable i can be regarded as either type-I unfrozen or type-II unfrozen [17j . If flipping variable i to Gi — a* leads to 
the fixation of the spin values of a finite fraction of all the other unfrozen variables, then variable i is type-I unfrozen; 
if setting ai = a* only affects a small number of other unfrozen variables, then i is type-II unfrozen. We denote by R 
the probability that a randomly chosen unfrozen variable i is type-I unfrozen with respective to a randomly specified 
spin value a*. 

The spin states of the type-I unfrozen variables of the random A'-SAT formula are strongly correlated. If one 
fix the spin of one such variable, the final effect might be that the spins of a large fraction of all the other type-I 
unfrozen variables are also being fixed. A type-I unfrozen variable takes different spin values in different dominating 
communities, but within each community its spin is frozen to one value. 

As one adds more clauses to the random ii'-SAT formula, the solution cluster shrinks, and then both qo and R will 
change. According to Ref. fT7^ the following set of self-consistent equations can be derived: 

oo 

go - ^(P„(m))^ (4) 

R = 1- exp(-A3i?) . (5) 
In the above two equations, the function Py(m) is defined by 



As shown in Figs. 2|4 



m 

P„(m) = ^/(n,A2)P/(m-n) 

n=0 



(6) 
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with 

oo 

Pf{n) = fi2n,X,)C-,2-'-+ /(s, Al)C^2l~^ 

s=2n+l 

/(n,A) = e-^A"/n! , 
and the three A parameters are expressed as 

Ai = Ka{{{qoR+l-qo)/2)^-'-{{l-qo)/2)''-') , 

A3 - {K{K-l)aqo{l-R)/2){{l-qo)/2)''-^ . 



K 


aj 


ad [6 


as '20] 


9^ 


3 


4.1897 


3.87 


4.2667 


0.5270 
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9.2653 


9.38 


9.931 


0.3994 
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19.1480 


19.16 


21.117 


0.3359 


6 


39.0269 


36.53 


43.37 


0.2967 


7 


79.4245 




87.79 


0.2694 
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161.78 






0.2479 


9 


329.704 






0.230 


10 


671.796 






0.2147 


11 


1368.01 






0.2015 


12 


2783.8 






0.190 



TABLE I: is the threshold value of jamming transition as predicted by the long-range frustration theory [T7], ad is the 
clustering transition point reported by Ref. [6], Os is the satisfiability transition point 20 , and is the fraction of unfrozen 
variables at the jamming transition as predicted by the long-frustration frustration theory. 



The jamming transition point af corresponds to the smallest value of a at which a fixed point 50 < 1 of Eqs. (4| 
and (jsj) first appears. Table |l]lists the value of a°° for 3 > X < 12 and the corresponding fraction of unfrozen variables 
9q at the jamming transition. As a comparison with simulation results, we have denoted by a red dashed line the 
predicted jamming transition point af in Figs.ITlHl For if = 3, we find = 4.19, which is larger than the clustering 



transition point ad — 3.87 but is in agreement 'with the simulation results of Fig. [T| while for iiT = 4, a°° = 9.27 is 
smaller than the dynamic transition aa — 9.38. As Fig. [2] shows, for random 4-SAT formulas the SEQSAT process is 
able to reach constraint density values higher than . The long-range frustration mean-field theory appears to give 
a satisfactory prediction of the jamming transition point of SEQSAT for random 3-SAT formulas but fails in the case 
oi K = A and K = 6. 

The solution space structure of the random 3-SAT problem is qualitatively different from that of the random K- 
SAT problem with K > A [Q . Beyond the clustering transition point ad, the solution space of a large random 3-SAT 
formula is dominated by only a sub-exponential number of solution clusters, while that of a large random if -SAT 
{K > 4) is divided into an exponential number of solution clusters of equal statistical importance. The full-step 
replica-symmetry-breaking mean-field theory is needed to fully describe the statistical property of the solution space 
of the random 3-SAT problem a > adii) [33 1 but for the random K-SAT problems with K > 4, a simpler first-step 
replica-symmetry-breaking theory is believed to be sufficient. As the solution space of a random if -SAT {K > 4) 
formula has exponentially many communities or clusters at the vicinity of the clustering transition ad, some of the 
assumptions of the long-range frustration mean-field theory may no longer be appropriate. 

Figure [6] shows how the predicted jamming transition point scales with K. The data is consistent with 

(if) -2^1n2 + 0(l) . (7) 

Notice that the satisfiability threshold as (if) also the same scaling behavior [20]. The jamming value a°°{K) may 
serve as a good lower bound for the satisfiability threshold of the random if-SAT problem. 
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FIG. 6: The scaling behavior of the predicted jamming transition point q°° with K. The red dashed line is a fitting curve of 
the form a°°{K) — 2^ ln2 — co, with the fitting parameter being cq — —1.5 ± 0.2. 



VI. CONCLUSION AND FURTHER DISCUSSIONS 



In this work the dynamic behavior of a simple stochastic search algorithm SEQSAT for the random A'-SAT problem 
were investigated by computer simulations. This simple algorithm is able to find solutions for a large random K- 
SAT {K > 3) formula if the constraint density a is less than certain threshold aj{K), but it experiences a jamming 
transition as a approaches aj{K) from below. For K > A, we found that the jamming point aj{K) is very close to the 
solution space clustering transition point a^^K), but the jamming point aj(3) ~ 4.19 for the special case of K — S 
exceeds ad{3) — 3.87 considerably. We argued in this work that, the dramatic slowing down of SEQSAT at a close to 
the jamming point aj{K) is caused by the entropic trapping effect of various solution communities in a single solution 
cluster of the random K-SAT formula. We also estimated the jamming transition point aj{K) using the mean-field 
long-range frustration theory of Refs. [121 HZ], and found that the calculated value of aj(3) is in good agreement with 
simulation results. 

The rapid increase of the search time T{a) of the dynamic process SEQSAT at a close to the jamming point aj is 
reminiscent of the rapid increase of viscosity of a glass-forming liquid at low temperatures. These glassy behaviors 
may be governed to a large extent by the same physical mechanisms. The random if-SAT problem might serve 
as a very rich model system to study the connection between complex energy landscapes and glassy dynamics. In 
the present paper, the dynamics of SEQSAT is confined to a single connected component of the zero-energy ground- 
state configuration space of a if -SAT formula; in future studies, one may introduce external fields and/or a finite 
temperature to the system to observe more complex dynamic behaviors. 

When the constraint density a of the random if -SAT formula is slightly beyond the jamming point aj, the formula 
still contains exponentially many solutions, but SEQSAT is unable to reach any one of them. On the algorithmic 
side, a major limitation of SEQSAT is that it only explores a single connected component of the solution space. 
One may incorporate the energy-barrier crossing techniques of other heuristic algorithms into SEQSAT to enhance its 
performance. We also demonstrated that the efficiency of searching within a solution cluster can be elevated to some 
extent by using biased random walks [HI |33! , but such a small change of local search rule does not lead to a shift of 
the jamming point aj{K) to larger values. 

Although the mean-field long-range frustration theory |17 l is able to give good predictions on the jamming transition 
point ctj{K) for K — 3, it fails to do so for if = 4,6. For if > 4 the jamming transition point of SEQSAT probably 
is identical to the clustering transition point ad- There is an important uncontrolled approximation in the mean- 
field theory, namely that two type-I unfrozen variables have probability one-half of being prohibited from taking 
simultaneously their canalizing spin values [TBI I17j . This approximation may not be very appropriate for the case of 
if > 3. On the hand, as shown in Table ^ the predicted jamming transition point aj{K), whose value is close to 
ad{K), is always lower than the satisfiability threshold as{K) and has the same scaling behavior of ctj{K) « 2^ ln(2) 
as as(if). 
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